Audio-Video Based Segmentation and Classification using AANN
نویسنده
چکیده
This paper presents a method to classify audio-video data into one of seven classes: advertisement, cartoon, news, movie, and songs. Automatic audio-video classification is very useful to audio-video indexing, content based audio-video retrieval. Mel frequency cepstral coefficients are used to characterize the audio data. The color histogram features extracted from the images in the video clips are used as visual features. Auto associative neural network is used for audio and video segmentation and classification. The experiments on different genres illustrate the results of segmentation and classifications are significant and effective. Experimental results of audio classification and video segmentation and classification results are combined using weighted sum rule for audio-video based classification. The method classifies the audio-video clips with effective and efficient results obtained.
منابع مشابه
Audio-video based Segmentation and Classification using SVM and AANN
In this paper, we propose a method for combining audio and video for segmentation and classification. The objective of segmentation is to detect the category change point such news to advertisement. The classification system classify the audio-video data into one of the predefined categories such as news, advertisement, sports, serial and movies. Mel frequency cepstral coefficients( MFCC) are u...
متن کاملAudio-Video based Classification using SVM and AANN
This paper presents a method to classify audio-video data into one of five classes: advertisement, cartoon, news, movie and songs. Automatic audio-video classification is very useful to audio-video indexing, content based audio-video retrieval. Mel frequency cepstral coefficients are used to characterize the audio data. The color histogram features extracted from the images in the video clips a...
متن کاملVideo Classification and Shot Detection for Video Retrieval Applications
Appropriate organization of video databases is essential for pertinent indexing and retrieval of visual information. This paper proposes a new feature called Block Intensity Comparison Code (BICC) for video classification and an unsupervised shot change detection algorithm to detect the shot changes in a video stream using autoassociative neural network (AANN) which makes retrieval problems muc...
متن کاملUnsupervised Speaker Segmentation using Autoassociative Neural Network
In this paper we propose an unsupervised approach to speaker segmentation using autoassociative neural network (AANN). Speaker segmentation aims at finding speaker change points in a speech signal which is an important preprocessing step to audio indexing, spoken document retrieval and multi speaker diarization. The method extracts the speaker specific information from the Mel frequency cepstra...
متن کاملCombining Audio-Based and Video-Based Shot Classification Systems for News Videos Segmentation
In this paper we propose an innovative combination strategy for a system using video and audio stream of a news video to automatically segment it into stories. In our approach, the segmentation is performed in two steps: first, shots are classified by combining three different anchor shot detection algorithms using video information only. Then, the shot classification is improved by using a nov...
متن کامل